On the detection of the intelligibility advantage of clear speech vs. casual speech
نویسندگان
چکیده
Studies show that even though intelligibility is increased by the decrease of speech rate both in clear and casual speech, clear speech can also be produced without the decrease of rate after training the speakers [1]. This suggests that clear speech has inherent acoustic properties independent of rate, that contribute to improved intelligibility. Many speakers in their effort to elicit clear speech change their pitch both in level and range. However, it is not luminous if pitch modification is a feature that contributes to intelligibility. In this work, we examine if clear speech signals are still more comprehensive than casual speech signals after equalizing the prosody features on the two signals. To this purpose, a database of clear and casual speech signals is analyzed. Speakers in this database read sentences both in clear and casual way [2]. Clear speech sentences are modified in duration and pitch to match the corresponding attributes of casual speech signals. After the equalization, pilot acoustical test analysis and objective measure tests are performed on the four equal set of signals; on the initial database of clear and casual signals and additionally on the time-scaled and time and pitch-scaled clear signals. In the acoustical pilot experiments, speech shaped noise is added to the signals to create the test signals, with Signal to Noise Ratio of 0dB. Results show that on a set of pairs of clear and casual sentences, in 64% of the cases listeners found more intelligible the clear sentences. However, in time-scaled and time-pitch-scaled modified clear sentences intelligibility scores were deteriorated. Objective measure tests were also performed, using a modified version of the extended Speech Intelligibility Index (SII) [3]. SII was evaluated in a separate database giving high correlation scores with perceptual acoustical tests. According to the SSI measure, clear signals have higher intelligibility scores than casual signals (Fig.1(a)) with higher probability (Fig.1(b)) of identifying a sentence for SNR levels above −5dB. On the other hand, casual signals, time-scaled and timepitch-scaled clear signals that have the same duration, give the same score of SII independent of the SNR level (Fig.1(a)). Pilot acoustical experiments and objective measures suggest that duration indeed plays a significant role to intelligibility, whereas pitch modifications do not seem to contribute to intelligibility. −10 −8 −6 −4 −2 0 2 4 6 8 10 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
منابع مشابه
Intelligibility enhancement of casual speech for reverberant environments inspired by clear speech properties
Clear speech has been shown to have an intelligibility advantage over casual speech in noisy and reverberant environments. This work validates spectral and time domain modifications to increase the intelligibility of casual speech in reverberant environments by compensating particular differences between the two speaking styles. To compensate spectral differences, a frequency-domain filtering a...
متن کاملCan modified casual speech reach the intelligibility of clear speech?
Clear speech is a speaking style adopted by speakers in an attempt to maximize the clarity of their speech and is proven to be more intelligible than casual speech. This work focuses on modifying casual speech to sound as intelligible as clear speech. First, we examine the role of speaking rate for intelligibility. Clear and casual speech signals are time-scale stretched, matching the average d...
متن کاملمدل میکروسکوپی دوگوشی مبتنی بر فیلتر بانک مدولاسیون برای پیش گویی قابلیت فهم گفتار در افراد دارای شنوایی عادی
In this study, a binaural microscopic model for the prediction of speech intelligibility based on the modulation filter bank is introduced. So far, the spectral criteria such as the STI and SII or other analytical methods have been used in the binaural models to determine the binaural intelligibility. In the proposed model, unlike all models of binaural intelligibility prediction, an automatic ...
متن کاملبررسی وضوح گفتار کودکان فلج مغزی اسپاستیک 8 تا 12 ساله
Background and purpose: Speech intelligibility refers to how speech is understandable by listeners. This study examined speech intelligibility in children (Persian native speakers) with spastic cerebral palsy aged 8-12 years old. Materials and methods: A cross-sectional study was performed in 31dysarthric students (….. boys and …..girls) in Tehran, 2014. A list of w...
متن کاملSpeech difficulties in Joubert syndrome
Introduction: "Joubert syndrome" was first introduced in1969. This syndrome is a rare genetic disease with autosomal dominantpattern. Hypotonia, ataxia and motor delay of the disease known as clinical manifestations. In the few reports of this syndrome, mostly functional and structural components studied and radiographic images such as speech and language developmental delay symptoms has been l...
متن کامل